Incremental Task Modification via Corrective Demonstrations
نویسندگان
چکیده
In realistic environments, fully specifying a task model such that a robot can perform a task in all situations is impractical. In this work, we present Incremental Task Modification via Corrective Demonstrations (ITMCD), a novel algorithm that allows a robot to update a learned model by making use of corrective demonstrations from an end-user in its environment. We propose three different types of model updates that make structural changes to a finite state automaton (FSA) representation of the task by first converting the FSA into a state transition auto-regressive hidden Markov model (STARHMM). The STARHMM’s probabilistic properties are then used to perform approximate Bayesian model selection to choose the best model update, if any. We evaluate ITMCD Model Selection in a simulated block sorting domain and the full algorithm on a real-world pouring task. The simulation results show our approach can choose new task models that sufficiently incorporate new demonstrations while remaining as simple as possible. The results from the pouring task show that ITMCD performs well when the modeled segments of the corrective demonstrations closely comply with the original task model.
منابع مشابه
Incremental Acquisition of Task Knowledge
Learning tasks from human demonstration is a core feature for household service robots. To increase the utility of future robot servants, the robot should go beyond simply imitating the user’s behavior but try to build flexible, extensible and general task knowledge. This knowledge should at the same time encode the constraints of a task while leaving as much flexibility for optimized reproduct...
متن کاملTowards robot incremental learning constraints from comparative demonstration
This paper presents an attempt on incremental robot learning from demonstration. Based on previously learnt knowledge about a task in simpler situations, a robot learns to fulfill the same task properly in a more complicated situation through analyzing comparative demonstrations and extracting new knowledge, especially the constraints that the task in the new situation imposes on the robot’s be...
متن کاملInteractive Policy Learning through Confidence-Based Autonomy
We present Confidence-Based Autonomy (CBA), an interactive algorithm for policy learning from demonstration. The CBA algorithm consists of two components which take advantage of the complementary abilities of humans and computer agents. The first component, Confident Execution, enables the agent to identify states in which demonstration is required, to request a demonstration from the human tea...
متن کاملComparison of the effects of Written Corrective Feedback and Task-complexity Manipulation on the Grammatical Accuracy of EFL learners’ Writing
This study compared the effects of teacher-provided direct and indirect written corrective feedback and manipulation of resource-directing dimensions of task cognitive complexity along +/- Here and Now condition on the grammatical accuracy of Iranian intermediate EFL learners’ narrative writing tasks. There were 45 participants in the study who were randomly assigned to three experimental group...
متن کاملIncome Taxation and Optimal Government Policy
Various economic literatures address the question whether first-best prescriptions for government policy require modification because redistributive income taxation distorts labor supply and cannot achieve the distributive ideal. Perhaps second-best rules for public goods provision, corrective taxation, public sector pricing, and other government activity should reflect concerns about distribut...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2018